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Abstract 

The top quark sector is almost decoupled from lighter quark genera- 
tions due to the fact that Vtb ~ 1- The current experimental measure- 
ments of Va are compatible with the Standard Model expectations but 
are still dominated by experimental uncertainties. In this manuscript, a 
revision of the experimental methods used to measure Vtb is given, and a 
simple method to probe heavy flavor content fraction of top quark events, 
R = B(t — > Wb)/B(t — > Wq), is presented and discussed. Prospects for 
the measurements at the Large Hadron Collider based on generator level 
simulations are outlined. 

1 Introduction 

One of the yet unveiled mysteries in nature is the mechanism that gives mass to 
quarks, leptons and gauge bosons breaking the electroweak symmetry. The top 
quark, the heaviest elementary fermion observed, constitutes per se a puzzle in 
this context not only due to its large mass (close to a Yb atom) but also due 
to the fact that it is practically decoupled from the lighter quark generations 
decaying almost solely into the t — > Wb channel. The study of top quark events 
and its properties is therefore potentially an interesting topic and one of the last 
barriers to cross in the search for physics beyond the Standard Model (SM) of 
interactions. 

In the context of the SM and at hadron colliders, such as the Tevatron or the 
Large Hadron Collider (LHC), top quarks are produced through the interaction 
of quarks and gluons, either in pairs or as single Top production (through the 
s— , t—, tW— channels). While ti pairs are mostly created from pure QCD 
processes, single top quark production is more often produced from electroweak 
processes and its production cross section is directly proportional to the (square 
of the) CKM matrix elements related to the top quark, i.e. Vt q . Therefore, 
the measurement of single Top cross section in the different channels provides 
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a direct measurement of the CKM matrix elements. However, results from 
single Top production are still constrained mainly due to the limited number of 
events (at the Tevatron, the single Top cross section is ^2.8 pb, while ti pair 
production has a cross section of ^7.5 pb) and to the larger relative background 
contribution. In practice, single top quark discovery has only been reported 
recently. The value of Vtb is measured with an 8% relative uncertainty, assuming 
that Vtsi Vtd *C Vtb @ E]- A more accurate determination of the separate 
contributions of the different channels is expected to be feasible at the LHC, 
and the sensitivity to the tW— channel will be in the reach of the experiments for 
the first time. A direct determination of the t— channel cross section is expected 
to yield a measurement of V t b with an overall uncertainty of 5% after 10 fb _1 [I]. 
Top quark pairs may provide an indirect measurement of Vtb, through the flavor 
determination of the Top quark decay products. In particular, the ratio of 
branching fractions of the the top quark 

R= B{t ^ Wh) (1) 
£>(t -> Wq) 

1 

can be measured in the data and, by assuming the unitarity of the CKM 
matrix elements, one can infer \Vtb\ 2 from Eq. [ljand derive Vtb — VR- This in- 
direct measurement has been explored at the Tevatron [5,6. The measurement 
using ti pair events profits from the larger production cross section (~7.5 pb at 
the Tevatron and ^908 pb for the LHC at y / s=14 TeV) and from the distinctive 
ti experimental signature with a much reduced relative background contamina- 
tion. 

Results always suffer however from the systematic uncertainties related to: 
i) the imperfect knowledge of the efficiency of the algorithms used to identify the 
flavor of jets ( "b-tagging" ) , and to ii) the background modeling. In the current 
Tevatron results, R is determined with a relative uncertainty of [5] . Notice 
that R w 1 is a direct indication of the strong hierarchy between the top quark 
sector and the lighter quark generations. However, R as 1 can still be compatible 
with Vtb < 1 if the structure of the CKM matrix is different from the one 
assumed in the SM (e.g. if a fourth family of quarks exists or if the top quark 
sector couples to a massive vector like quark t'). The current measurements of R 
and Vtb have not yet reached the required precision to discard such hypotheses. 
Therefore the measurement of both R and the deviations in the heavy flavor 
content of ti events are particularly important to search for evidence of New 
Physics at the LHC. A brief summary of the SM experimental signature of the 
top quark events and examples of how the ti sample can be contaminated by 
non-SM processes are given in Sec. [2| The discussion of the method and how 
the measurement of R and of the cross section can be performed at the LHC 
with early data is discussed in Sec. [3] Conclusions are summarized in Sec. |4| 



2 Top quark decays: Standard Model and be- 
yond 

Due to their large mass {m t > raw + top quarks decay promptly without 
hadronizing, t — > Wb. Other decays, such as t — > Ws and t — > Wd are sup- 
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pressed in the context of the SM, approximately by a factor of 10~ 3 -j- 10~ 4 . 
In the prompt decay scenario, the polarization of the top quark is transferred 
to the decay products and three final polarization states can be expected. As 
mj < mj, it is possible to assume in first approximation that the bottom quark 
behaves almost as a massless particle and that it can mostly acquire left-hand 
polarization, and the polarization of the top quark is transferred to the W ac- 
cording to the ratio mw/^t- ln this approximation, right-polarized W states 
are highly suppressed and the transversal polarization states are dominant. 

The W boson decay channels determine the experimental signature of a ti 
event: "fully hadronic", "lepton+jets" , or "dilepton" events accompanied by 
the presence of two b jets are therefore expected. In hadron collisions, ti pairs 
are also accompanied by an underlying event which is the result of the proton 
remnants and by extra initial and final state radiation (ISR/FSR) jets. 

Among all, the dilepton channel has a particularly clean final state with 
two oppositely charged prompt and isolated leptons with large pr, at least two 
jets from 6-decays and large missing transverse energy from the neutrinos. The 
branching fraction of the ti dilepton channel is B[ti — > (lvib){lvib)] — 0.065±0.001 
(I — e, (j,) [7J. SM processes that can mimic the dilepton channel involve mainly 
the production of Z and W bosons, either as single particles or in pairs (i.e. di- 
boson production). For instance the same-flavor dilepton channels (i.e. ee and 
H/i) can be mimicked by Drell-Yan processes, mainly from Z boson production, 
that can be distinguished from ti events by excluding dileptons with a mass 
close to the Z boson. Single top production, in particular the tW channel can 
also mimic the dilepton channel final state, but with a much lower cross section. 
QCD events, although expected to be the dominant production at a hadron 
collider, are not expected to produce isolated, high-pr leptons. Therefore the 
possible contamination from this processes is expected to be small, and mostly 
due to instrumental and misreconstruction effects. In the dilepton channel the 
presence of at least two neutrinos accounts for an experimental transverse en- 
ergy imbalance in the detector which can possibly discriminate further ti events 
against background. 

Processes not contemplated in the SM may also contaminate the ti sample. 
Searching for deviations of the basic properties of ti events, as they are predicted 
by the SM, is therefore an important topic of investigation at the LHC. Mea- 
suring the production cross section, the mass, width and branching fractions of 
the top quark, the polarization of the final state particles and the jet content of 
the events are some examples of these studies. In this manuscript an emphasis 
is put on the search for deviations in the jet flavor content of the data sample. 

Some examples which may alter the heavy flavor content of the ti sample 
can be found in the literature: 

• associated Higgs boson production, decaying to a bb pair (i.e. tiH — > tibb). 
Although the cross section is expected to be small, it may be enhanced by 
non-SM processes such as G — > t'i — » tiH where G is a heavy color octet 
boson and t' is a vector- like fermion At the time this manuscript is 
written a lower mass limit of 311 GeV/c 2 has been set for t' by the CDF 
experiment with 2.8 of data [5]; 

• if a 4 th generation of quarks exists, then decay cascades containing top 
quarks could be produced. These events are expected to have either extra 
jets or additional leptons from the decays of multiple W bosons. Results 



3 



from the CDF experiment set a lower limit for my >325 GeV/c 2 with 
2.7 fb- 1 of data [TO]; 

• in the context of a supersymmetric theory, the mixing of chiral and scalar 
states could be privileged in the top quark sector due to its large mass. If 
X° is assumed as the lightest supersymmetric particle and the stop mass 
is (m~+ + mj,) < m t ~ < m t , then gg — > ttqq (or tt) is allowed with the stop 

decaying further in t — > tx° is expected to have the typical signature of a tt 
pair plus additional $t due to undetected neutral particles, and possibly 
larger lepton/jet multiplicities. Limits on stop mass were obtained by the 
CDF experiment with 2.7 fb -1 of data for different values of B(xt 
Xilvi) and Xi masses [TT]; 

• the production of a charged Higgs from a top quark decay can also be ex- 
pected in the context of supersymmetry if nit > mH+ + m f>- This scenario 
would be reflected by an increase in the production of the r dilepton chan- 
nel with respect to the other channels from H + — > tv t decays. However, 
in the case of leptophobic H + an increase of charm and strange particles 
from the production of cs pairs could also be expected. With 1.0 fb -1 
the DO collaboration performs a model-independent measurement which 
excludes B{t ->■ H+b) > 0.12-0.26 at 95% C.L. depending on Aig+P]. 

Using a different approach from the various phenomenological scenarios just 
briefly reviewed, a generic model-independent search aimed at the study of the 
heavy flavor content of tt events is proposed in the following [1] . If measurements 
in the data are compatible with the SM expectations, V t b can be determined 
under the particular assumption on the structure of the CKM matrix. 

3 Probing the heavy flavor in tt events 

Decay products of top quark events are expected to contain 6-jets, resulting 
either from the top quarks or other heavy particle decays as reviewed earlier. 
Experimentally, the flavor of a jet can be identified by taking advantage of the 
properties of the hadrons containing b quarks. 

B hadrons are long-lived with proper-lifetimes typically of the order of 
ctb ~ 450 [ira. After the decay of long-lived hadrons into charged particles, 
a secondary vertex of tracks associated to a jet might be found. In «20% of 
the cases these jets can also contain an electron or a muon due to the semi- 
leptonic branching ratio of the B mesons. Therefore, jets produced by heavy 
flavor quarks can be identified by displaced tracks or secondary charged leptons. 
The "6-tagging" algorithms exploit these simple properties to build discrimina- 
tor variables which can be used to identify the flavor of a jet. Each algorithm is 
usually characterized by an efficiency of identifying b jets (ef,) and a "mis-tag" 
rate due to mis-identification of light quark and gluon initiated jets (e q ) [13j . 
Experimentally, Sb can be measured in QCD events with reconstructed jets con- 
taining muons. These events are most likely 6-jet enriched events (i.e. from 
bb) and allow one to evaluate the 6-tagging efficiency using a "tag and probe" 
method. The mistag rate can also be estimated from data using the fact that 
for light quark and gluon-initiated jets the tracks are distributed randomly in 
the vicinity of the primary vertex of the hard interaction. Therefore, the tracks 
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produced upstream with respect to the primary vertex of the hard interaction 
can be used to extrapolate the contamination in the 6-tagging region. Both mea- 
surements are sensitive to the alignment of the tracker detector, to the track 
reconstruction efficiency and resolution of the track impact parameter and to 
the resolution of the vertices. 

By using the b tagging algorithms just described, a method can be developed 
to probe flavor deviations in the decays of ti pairs. Generator level simulations 
are used in the following to illustrate how to apply the method to the data. 



3.1 Description of the method 

Evidence for the heavy flavor content of the ti final states can be obtained by 
counting the 6-tag multiplicity found in the selected events. The number of b- 
tags counted in the events is proportional to the fraction of b jets present in the 
sample, and is related to Sb and e q . Ultimately, the number of b jets produced 
depends on the ratio of branching fractions of the top quark to bottom quarks 
(Eq. [I]) . In particular, a simple selection of tt dilepton events can be devised 
where the final sample is dominated by top quark events over the background 
(S/B 16). An example of the 6-tag multiplicity distribution of ti dilepton 
events is shown in Fig. [T] (left). The distribution is obtained from Monte Carlo 
(MC) using typical values for Sb and e 9 , randomly thrown to assign each jet 
a 6-tag. The two 6-tag multiplicity bin clearly dominates, as expected in a 
it-dominated sample where two jets are from heavy flavor jets. 
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Figure f : (Left) 6-tagging multiplicity obtained assuming a jet is 6-tagged as- 
suming £h=80% and e q —15%. (Right) Probability model for the observation of 
k 6-tags in events with two jets, as function of R. 

Although in the dilepton channel small contributions are expected from back- 
ground processes, there is a non-negligible probability that at least one jet from 
a ti decay is either missed because it was not reconstructed or because it did 
not pass the jet selection criteria. Another jet that may be selected instead 
could, for example, be a jet initiated from ISR or FSR. This will be referred to 
as "jet misassignment" and its determination from data is discussed in detail 
in Sec. |3.3| The b-tag multiplicity distribution reflects also the fraction of jet 
misassignment in the sample, namely the fractions of events with: 
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• no jet from the top decays selected (background-dominated); 

• only one jet correctly assigned to a top decay (combination of signal and 
background); 

• two jets correctly assigned to the top decays (signal-dominated). 

The relative contributions of these three classes of events can be expressed 
by the weights Qj, where J2i a i = 1 an d i = 0, 1, 2 is the number of jets from top 
decays correctly reconstructed and selected. For example, a-i is the probability 
that both jets are correctly assigned to b jets. 

Using R, £;,, e q and the Qj (as previously defined), it is possible to model 
probabilistically the average 6-tagging multiplicity. The number of expected 
events with k tagged jets can be generically written as follows: 

all jets 

N ev (kb-tags)= N ev(n jets) ■ P^lf ags (2) 

n jets— 2 

where N ev (n jets) is the number of observed events with n-jets, and P^ jj! t s ags 
is the probability to count k b-tags in a n-jet event. The probability function 
encloses the available knowledge on R, e q and o^. In order to illustrate 
explicitly the construction of such probability functions, the exclusive two-jet 
multiplicity bin is used and the following expression is obtained: 

2 

p 2 jets _ \^ p i | 2 jets 

k b-tags — ' r k b-tags \°) 

i jets— 
from top decay 

where the a, parameters describe the composition of the sample in terms of 
events with z-jets from top quark decays correctly reconstructed and selected, 
and Pk b _ t ag S tS is the probability that k 6-tags are observed in an event with two 
jets of which i jets come from ti decays. Extra jets can be reconstructed and 
selected in the selected events mainly coming from Initial/Final state radiation 
produced along with the tt pair. This extra jets can mimic ti events even if 
not coming directly from top decays. The previous expression can therefore be 
further developed and, in particular for the case where two tagged jets are found 
in a 2-jet event, the probability can be explicitly written as: 



P 2 j,_t gg* S = £q if no jets are from tt decays 

P 2 \L 2 t J g T = 2R 2 e b £q + 2R(1 - R)(e b + e q )e q + 2(1 - R) 2 e 2 q if 1 jet is from ti decays 

P 2b-tagT = R2£ b + 2i? (! - R)Sb£q + (1 - R)' ^ if 2 jets are from ti decays 

(4) 

So far, the probability model has been derived for the most relevant case 
which is the observation of two 6-tags in the 2-jet multiplicity bin. The model 
can be expanded to a more general situation but it is omitted from this manuscript 
for the sake of simplicity. In this probability model it is assumed that: 
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1. the correlation between the probability of observing single and double tags 
is negligible; 

2. for higher jet multiplicity bins the probability can be computed from the 
product of P£ l^t ags with the probability of observing the extra tags from 
the extra radiation jets; 

3. the mistag rate e q is an effective measurement of the probability of "tag- 
ging" light quark, gluon and charmed jets in the dilepton sample. In first 
approach, it is taken as the mistag rate for light quark and gluon jets 
but can also be derived directly from data as it will be shown in the next 
Section. 

By summing up the contributions from events with 2, 1 or no correctly 
assigned jets (weighted by 014), as expressed in Eq. [3j one builds the probability 
model shown in Fig. [I] {right). 

Using the generic Eq. [2j R can be fit from the 6-tag multiplicity distributions 
by maximizing the likelihood function: 

all jets 

C(R, £b, s q , a>i) = Y\ Voisson[N ev (k b — tags), N ev (k b — tags)] (5) 

fe=0 

where N ev (k b — tags) is the number of observed events with k b— tags. 
Equation [5] can be used to perform the fit using different choices and sub- 
samples of the selected events. The results obtained will be discussed in more 
detail by applying the model to a MC generated sample. 

3.2 Dilepton sample selection 

In order to illustrate the potential of the method described, a dilepton sample 
was generated using MadGraph [T3] and Pythia [T5] generators with R = 1. 
Inclusive tt events and the most relevant background processes: single top, 
W/Z+jets, WW, WZ, ZZ and QCD were generated. The events are selected 
mimicking a realistic dilepton event selection from reconstructed events as the 
ones proposed by the ATLAS and CMS collaborations [T()I ITTHTg] . 

A simple event selection requires the presence of two high-p^ isolated leptons, 
either electron or muon, at least two jets, and large missing transverse energy. 
This simple selection of the tt dilepton channel yields a clean sample with S/B « 
16 For the study presented here it is expected to collect 650-700 events, 
whereas only 6% of the events are from background sources when an integrated 
luminosity of 100 pb _1 at y/s=10 TeV is considered. The exact number of the 
events selected is not relevant to the results of the method presented here, as it 
will become clear. 

In the selected samples (signal and background), the number of events with 
2, 1 or jets from top quark decays correctly reconstructed and selected can be 

1 For proton-proton collisions at a center of mass energy of 7 TeV it can be inferred from 
the re-scaling of signal and background cross sections obtained with the Pythia generator that 
the S/B figure of merit should decrease by approximately 30%. It must be noted that cuts 
have been gauged for 10 TeV collisions and optimization of the S/B can be performed for 7 
TeV collision. 
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counted using MC truth. Table [T] shows the fraction of events with 2, 1 or jets 



(i.e. the values for the on introduced in Sec. 3.1). In first approximation, the oti 
can be parameterized as a binomial combination of a, the probability that a jet 
from a top quark decay is reconstructed and selected. The probability to have 
two jets from tt decays reconstructed and correctly assigned (independently of 
its flavor) is therefore given by a.2 = a 2 . Conversely, the probability that both 
jets from the top decay are missed in the selected event is given by «o = (1 — a) 2 - 
One jet from top decay is expected to be selected with a probability given by 
Q'i = 2(1 — a)a. The values for a are also shown in Tab. [ljand were obtained 
by fitting a binomial to the aj. 



Table 1: Values of a, for the selected events computed using MC truth. Different 
dilepton channels are shown The last column shows the value of a that fits best 
the ccj with a binomial function (see text). Only statistical uncertainties are 
shown. 



Dilepton channel 


a 2 


Oil 


a 


a 


e/i 


0.675 ± 0.042 


0.295 ± 0.024 


0.030 ± 0.005 


0.824 ± 0.009 


ee/fifi 


0.635 ± 0.041 


0.288 ± 0.023 


0.077 ± 0.009 


0.78 ± 0.01 



How can the correct values of oli be determined? It will be explained in the 
following. 

3.3 Jet misassignment estimate from data 

The OLi values can be estimated directly from data using the kinematic properties 
of the events. The method presented suggests to minimally rely on the MC 
simulation in order to minimize the uncertainty due to the MC model. 

The lepton-jet pairs originating from the same top quark decay are kine- 
matically correlated. The distribution of the angle between the lepton and jet 
directions of motion in the W^-boson rest frame, and the lepton-jet invariant 
mass are often pointed out as possible variables which depict experimentally 
the existing correlation between particles produced from the same top decay 
cascade |19j . The lepton-jet invariant mass shape is Jacobian with a kinemati- 
cal end-point at Mfl ax = \Jm 2 — m 2 ^ «156 GeV/c 2 after which no lepton-jet 
pair from tt decays is observed. On the other hand, the invariant mass of mis- 
assigned lepton-jet pairs exhibits a longer tail towards high mass values. The 
shape of these distributions depend on the tt pair kinematics and on the con- 
tamination from ISR/FSR jets in the sample. Two methods are proposed to 
model the invariant mass distribution of the mis-assigned lepton-jet pairs (i.e. 
background model): 

• "random rotation" of the momentum of the selected leptons, p — > 
pe(9,(j)); 

• "swap" the jet in the assigned lepton-jet pair, with a jet from a different 
event. 
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In practice these two methods yield similar invariant mass spectra and 
their average is used, which will be referred to as the combinatorial "mis- 
assignment" model. Figure [2] (left) shows the invariant mass distribution of 
the mis-assignment model after it has been rescaled in order to reproduce the 
tail of the inclusive lepton-jet spectrum for masses Mij >190 GeV/c 2 . In the 
"data" sample, each lepton (electron or muon) is paired with all selected jets and 
the invariant mass is computed for all pairs. The distribution of the correctly 
assigned lepton-jet pairs, derived from MC truth information, is also shown. 
The correct assignment distribution together with the mis-assignment model 
sums up correctly (within uncertainties) reproducing the inclusive lepton-jet in- 
variant mass spectrum ("data" points). The mis-assignment model describes 
well the expected (true) lepton-jet mis-assignment distribution in most of the 
mass region of the spectrum (Mij > 40 GeV/c 2 ). The low mass region of the 
spectrum (Mij < 40 GeV/c 2 ), which is due to almost collinear lepton-jet pairs, 
is expected to be suppressed in real data because the leptons are required to be 
isolated and the jets to be separated from the selected leptons [TBI I18j . 

Figure [2] (right) shows the distributions of the lepton-jet invariant mass pairs 
after the subtraction of the mis-assignment model. Good agreement with the 
correct assignment distribution (obtained using MC truth) is found in the high 
mass region (-My > 190 GeV/c 2 ) where the mis-assignments dominate and no 
signal lepton-jet pairs are expected. 




Figure 2: (Left) Inclusive lepton-jet invariant mass distribution. The spectrum 
obtained from the mis-assignment model described in the text is shown together 
with the distribution of the correctly (using MC truth) assigned lepton-jet pairs. 
(Right) Invariant mass distribution of the lepton-jet pairs after the background 
is subtracted. 

The contribution of jet mis-assignment (i.e. background) can be fit directly 
from the mis-assignment model by computing the ratio of areas below the Mij 
distributions, in "data" and in the mis-assignment model" (Fig. [2j left). The 
ratio is expected to flatten out in the region dominated by the background 
(typically Mij >190 GeV/c 2 ) and it provides the normalization factor of the 
background. If: 

• n(Mij) is the distribution of all lepton-jet pairs as a function of the pair 
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mass (total number of pairs is N); 



• n m i S (Mi t j) is the distribution of only mis-assigned pairs (total number of 
mis-assignments is N m i S ), 

then, for M[ j > z, it is assumed that all the pairs are mis-assigned. 

N%?>' = N M «>' (6) 

The fraction-/ of background pairs with Mi j > z can be used to estimate 
the total number of mis-assignments, N rnis . If N^'^ z is the number of back- 
ground pairs predicted by the background model with Mij > z and N mo( iei is 
normalized to the total number of pairs, then it is assumed that: 

N M U> Z jy M f,j> z 
£ mis model f*?\ 

AT AT 
Jv mis J * model 

Using the previous result, one can normalize the background model to the 
signal depleted range (i.e.Mij > z) Then, the fraction of correctly assigned 
lepton-jet pairs is given by: 

M, j >z 

2 N evts 2 N evts 2 N evts { N Ml -> z / 

x ™^del ' 



mo< 



The value of a, estimated by applying Eq. [8] to the lepton-jet invariant mass 
spectrum, yields a = 0.82 ± 0.03 for the "e/j," sample, and a — 0.80 ± 0.03 for 
the "ee and fifj," sample, in agreement with the MC-expected values of Tab. [l] 
(right column), within uncertainties. 

The robustness of the method described for the determination of a can be 
tested by artificially modifying some of the parameters. The selected events 
are re-weighted depending on the number of jets from top decays, after recon- 
struction and selection. The resulting distribution of the difference between the 
value obtained from "data" and the true generated value, i.e. (a — otMc), is 
shown as a function of auc m Fig- [3] The results are compatible with zero bias, 
meaning the method is able to reproduce the correct number of reconstructed 
jets from top quark decays. Variation of other parameters which are affected by 
experimental uncertainties, such as lepton and jet energy scales, or which are 
intrinsic to the method itself, such as the mass cut used to determine the nor- 
malization of the misassignment model, are are listed in Tab. [2j Overall, these 
variations account for <0.02% in the determination of a. One can therefore 
conclude that the determination of a is expected to robust and to be dominated 
by the statistical uncertainty. 

3.4 Measurement of R 

The measurement of R can be performed by fitting the 6-tagging multiplicity 
distribution (Fig. [T]) using the probability model (Eq. [5| : R is let to float freely 
without any constraint, and and e q are taken as inputs. The ct^ are measured 
from the "data" as described in Sec. 3.3 Figure [2] {left) shows the results 



obtained by fitting R to the 6-tagging multiplicity distribution for ee/e/z and e/i 
events, separately. The result is statistically compatible with MC expectations: 
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Figure 3: Deviation of the value of a derived from "data" with respect to the 
MC value, a gen (see text). 



Table 2: Deviations in a from the variation (shift) of different parameters. The 
determination of a was repeated after varying independently each parameter 
presented in the left column of the table (see text). Variations are presented 
separately for each channel. 



Parameter 


Channel 

e/j, ee/nn 


mass cut (5%) 


+0.005 
-0.01 


+0.006 

-0.01 


JES (10%) 


+0.01 
-0.01 


+0.01 

-0.01 


LES (1%) 


+0.006 
-0.006 


+0.009 
-0.009 


Total systematic (quad sum) 


+0.01 
-0.02 


+U.0I 
-0.02 
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R = 1.00 ± 0.03 (1.02 ± 0.04) for the e/i (ee/fifi) sample. By performing 
several pseudo-experiments and repeating the fit to the 6-tagging multiplicity 
distributions one obtains a good agreement with the MC truth, i.e. 5R =< 
R > -R MC = -0.006 ± 0.04 ( -0.008± 0.04 ) for the (ee/w) sample. The 



procedure has been also tested on pseudo-experiments where R 



MC 



< 1. The 



fit converges correctly to the generated value with similar uncertainties as the 
ones quoted above. 

The method can be easily extended to the simultaneous measurement of R 
and other quantities, such as ao (i.e. the contribution from events where no 
jet from top quark decays has been reconstructed and selected), e q (i.e. the 
mistagging efficiency), or the tt cross section which is discussed in more detail 
in Sec. 
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Such fits can be used as a cross check for the value of ao or e q 
used as inputs for the measurement of R, or in the latter case to perform a 
simultaneous measurement of two variables. It is important to notice that the 
convergence of the fit is mostly driven by the experimental knowledge of Sb- 
As R and Et, are correlated it is not possible to measure simultaneously these 
quantities from the same sample. Figure [4] (right) shows the contour plots of R 
and £(,. This contour plot can also be used to cross check the ^-tagging efficiency 
used as input to the fit. 
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Figure 4: (Left) Fit results for R. (Right) Contour plots (la, 2a and 3a) for the 
likelihood obtained by floating R and £(,. In the MC, R =1, £b =0.80, e q =0.15 
for both plots. 



The robustness of the method used to determine R is dominated by the 
uncertainty on the knowledge of the ^-tagging efficiency. With the first data 
acquired and the events selected for this measurement, only an average value 
for ejj can be used due to the limited number of events. When more data will 
be available, the jet pt and 77 dependence of £{, can be used. The measurement 
of R is also affected by the variation in e q and a. The uncertainty in a is 



affected by the factors discussed in Sec. 3.3 These uncertainties are small when 



compared to the uncertainty from the 6-tagging efficiency. Variation of ISR/FSR 
contamination of the sample does not affect much the final result of the fit as 
these variations are correctly described when a is estimated from data. Table [3] 
summarizes how R is expected to be affected from the variation of the different 
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fit parameters. 



Table 3: Variation of R due to the variation of the different fit parameters. £& 
is varied by 10% and e q is varied by 5%. All other parameters are varied as 



discussed previously in Sec. 3.3 



Parameter 




ee/fifi 


e b (10%) 


+0.10 
-0.10 


+0.10 
-0.10 


e q (3%) 


+0.02 
-0.01 


+0.02 
-0.01 


mass cut for a (5%) 


+0.03 

-0.02 


+0.03 
-0.02 


JES (10%) 


+0.02 

-0.02 


+0.02 
-0.02 


LES (1%) 


+0.005 

-0.005 


+0.005 
-0.005 


Total systematic 


±0.1 


±0.1 



The dominant uncertainty is due to eb (10%) which is taken in the early 
data-taking phase as an external input, and it is estimated from other physics 
processes. The uncertainty in the measurement of R is comparable to the reach 
of the Tevatron experiments at the time this manuscript was written. With more 
data it is reasonable to expect this uncertainty to decrease significantly [TH1 [20] • 
Notice that the parameters affect the different dilcpton channels in the same way. 
This feature is particularly important because it allows one to cancel out some 
systematic uncertainties by taking the ratio of the measurements obtained in the 
different channels. Combined measurements of R can therefore be performed 
with the purpose of reducing the systematic uncertainties. A procedure for 
combined measurements is proposed in the next Section. 

3.5 Combination of results in different channels 

The experimental uncertainty in the 6-tagging efficiency is the dominant factor 
affecting the measurement of R. However this uncertainty can be controlled 
by combining the measurements of R obtained from different channels. The 
procedure can be in fact generalized. For instance, the uncertainty in a can 
be controlled by combining the measurements of R obtained from different jet 
multiplicity bins in the same tt channel. With early data an iterative procedure 
can therefore be adopted taking advantage of different tt channels or control 
regions of the sample in order to control the systematic uncertainties affecting 
this measurement. Generically one can: 

• divide the sample in two regions: A and B - e.g. two different channels or 
two different jet multiplicity bins; 

• fit the 6-tag multiplicity distributions: 

— in sample A, assume R = 1 and leave the parameter x = sj,, a, ... to 
float freely in the fit; 

— in sample B, assume an initial estimate for parameter x — £j,,a, ... 
and leave R to float freely in the fit; 

• the fit yields a value of x for sample A and a value of R for sample B; 
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• repeat the fit giving as input R(B)/x(A) to sample A/B; 

• if the value R(B)/x(A) maximizes the likelihood in sample A/B it is kept 
otherwise the previous value is maintained; 

• the fit is repeated until no variation is observed in the values of R and x 
obtained from each sample. 

As an example, such procedure is applied to region A ("ee//i/i" sample) and 
region B ("e/x" sample). The 6-tagging efficiency is measured in region A, and 
R is measured in region B. As e b and R are fully correlated, a Gaussian term is 
added to the likelihood presented in [5] as follows: 

all jets 

C{R 

) = IT ~Poisson[N ev (kb-tags),N ev (kb-tags)}xGaussian(e b ,e b , 

(9) 

where e b is the experimentally measured value of e b in a b jet enriched sam- 
ple (see Sec. ||| and a £b is the corresponding uncertainty. This Gaussian extra 
term makes the fit more robust, as it takes into account the proper uncer- 
tainty associated to e b . The final result obtained is compatible with MC truth: 
Se b =< e b > -Ef G =0.005 ± 0.01 and SR =< R > -R MC =-0.004 ± 0.02. 
Table [4] summarizes how the variation of the different parameters affects the 
measurement of e b and R in the different channels. The initial uncertainty on 
the knowledge of e b is reduced by performing the proposed method. This result 
is particularly interesting given the fact that throughout this manuscript we 
have considered a sample corresponding to 100 pb _1 . Increased statistics and 
the knowledge of the detectors, together with additional inputs from other tt 
channels will certainly reduce the total uncertainty in the measurement of this 
quantity. 

Table 4: Variations of R and e b in the e/i and ee//x/x channels when an iterative 
procedure is used to measure this quantities simultaneously (sec text). 



Sample 


e/i 


ee/ /i/i 


Parameter 


R 


£fc 


e b (10%) 


+0.UU4 
-0.003 


+0.003 
-0.003 


e q (3%) 


+0.003 
-0.001 


+0.005 
-0.005 


mass cut for a (5%) 


+0.004 
-0.004 


+0.001 
-0.001 


JES (10%) 


+0.003 
-0.003 


+0.005 
-0.005 


LES (1%) 


+0.0007 
-0.001 


+0.003 
-0.004 


Total systematic 


+0.007 
-0.006 


+0.008 
-0.009 



3.6 Cross section measurement 

As stated in Sec. |3.4| it is possible to extend the method of measuring R to 
incorporate a simultaneous measurement of the cross section. The production 
cross section can be determined experimentally from the following expression: 
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tt dilcptons 



.4 



e 



L 



(10) 



where N t f dilcptons is the total number of ti dilepton events above the back- 
ground, A is the acceptance of the ti dilepton channel, e combines trigger, recon- 
struction and selection efficiencies, and L is the total integrated luminosity. The 
determination of both A, e and L is beyond the scope of this manuscript and it 
is therefore assumed in what follows. Details can be looked elsewhere [TBI I18j . 
On the other hand N t i diieptons can be computed using the quantities found in 
the previous sections, and can be parametrized as follows: 



Ntt diieptons — (1 — O-O) ' Advents — -^single top — -^othcr tt 



(11) 



where an, is the probability that no jets from top decays have been recon- 
structed and selected in the final sample. By combining Eq. [T0| and Eq. [TT] it is 
trivial to parametrize ctQ as a function of a. It is therefore possible to use this 
parametrization as an input to the likelihood function in Eq. [5] and to leave both 
R and a to float freely when fitting the ^-tagging multiplicity distribution. The 
result of this combined fit is shown in Fig. [5] and it is compared with the MC in- 
put given for both R and a. The result is shown in a detector-independent way, 
i.e., factorizing the acceptance and efficiencies in the measurement of a. A good 
agreement is found between the results of the method and the MC expectations, 
within the statistical uncertainty of the sample used for this study. 
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Figure 5: Combined fit results for R and a using the full dilepton sample. 
Contour plots (la, 2a and 3a) for the likelihood obtained by floating R and a 
are shown. In the MC, R =1, a =414 pb. 



The discussion on the robustness of the combined measurement is similar 



to the discussions carried through Sees. |3.3| and |3.4| A more detailed study on 
the final uncertainty on the measurement of the cross section would, however, 
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require full detector simulation and estimation on the acceptance and efficiency 
of detecting ti dilepton events which is out of the scope of this manuscript. 

4 Conclusions 

The physics of the top quark sector is one of the current topics of great interest to 
study at the LHC. The scale of the mass of the top quark is set at approximately 
the electroweak symmetry breaking scale, and according to several BSM models 
the preferential coupling may mainly occur through the top quark sector. In 
this context, the accurate measurement of the top quark properties is crucial to 
look for evidence of BSM physics at the LHC. 

In this manuscript, a discussion on the measurement of the heavy flavor 
content of ti decays was presented. A specific proposal was suggested to use a 
likelihood method to fit the 6-tagging multiplicity distribution, based on quanti- 
ties which can be measured directly from data in order to minimize the reliance 
on MC simulation. The method is expected to improve the current measurement 
of R, the ratio of branching fractions of the top quark R = B(t — »■ Wb)/B(t — > 
Wq), and it can be extended to measure simultaneously also the production 
cross section of ti events. 

For illustration, samples of ti dilepton events together with the main back- 
grounds were used. It was shown that a simple kinematical variable, such as 
the lepton-jet invariant mass, can be used to distinguish signal and background 
events. A reliable modeling of the background can be achieved without the use 
of MC samples, and can be instead directly derived from data. By using an 
iterative method, in which part of the unknown variables are measured in one 
sub-sample and another (i.e. R) is measured in the complementary sample, the 
systematic uncertainties can be controlled. Keeping in mind that a more accu- 
rate determination of the systematic uncertainty requires measurements from 
data and comparison with full detector simulation which were not used in this 
manuscript, the results here shown are, however, expected to hold. The case was 
studied for a number of ti events which is comparable to an equivalent integrated 
luminosity of C =100 pb _1 at ^=10 TeV. The measurement of R is expected 
to be mostly statistically dominated with an uncertainty of w 2%. Using the 
measurements obtained from other ti channels, namely the "lepton+jets" chan- 
nel, and in different control regions, namely different jet multiplicity bins in the 
same ti channel, the systematical uncertainties are expected to cancel out, or 
to be largely reduced. A measurement of R sensitive to deviations from the SM 
seems possible with early data at the LHC. 
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